A Uniform Approach toward Nested Parallelism

نویسندگان

  • Kazuhiko Kakehi
  • Kiminori Matsuzaki
  • Zhenjiang Hu
  • Masato Takeichi
چکیده

This paper develops a uniform approach toward such nested structures through converting these data structures into lists of tuples with depth information. Generic parallel computation schemes like reduce and accumulation are analyzed using the parallelization techniques over lists. We demonstrate its expressiveness and efficacy using a class of optimization problems called maximum marking problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Optimized Three Region Partitioning Technique to Maximize Parallelism of Nested Loops With Non-uniform Dependences

There are many methods for nested loop partitioning exist; however, most of them perform poorly when they partition loops with non-uniform dependences. This paper proposes a generalized and optimized loop partitioning mechanism which can exploit parallelism in nested loops with non-uniform dependences. Our approach based on the region partitioning technique divides the loop into variable size p...

متن کامل

Asynchronous Nested Parallelism for Dynamic Applications in Distributed Memory

Nested parallelism is of increasing interest for both expressivity and performance. Many problems are naturally expressed with this divide-and-conquer software design approach. In addition, programmers with target architecture knowledge employ nested parallelism for performance, imposing a hierarchy in the application to increase locality and resource utilization, often at the cost of implement...

متن کامل

Finding Synchronization-Free Slices of Operations in Arbitrarily Nested Loops

This paper presents a new approach for extracting synchronizationfree parallelism being represented by dependent statement instances of an arbitrarily nested loop. Presented algorithms can be applied to both uniform and non-uniform loops. The main advantage is that more synchronization-free parallelism may be extracted than that yielded by existing techniques. Our approach, based on operations ...

متن کامل

A Release Combined Scheduling Scheme for Non-Uniform Dependence Loops

In general, synchronization mechanisms can be used to preserve dependence constraints in any nested loop, and can be combined with a loop scheduling scheme to form a uniform framework to obtain the correct execution order and balance workload distribution. Most current scheduling mechanisms cannot handle non-uniform dependence loops. In this paper, we propose a new combined scheduling scheme ca...

متن کامل

Full Parallelism in Uniform Nested Loops Using Multi-Dimensional Retiming

Most scientific and DSP applications are recursive or iterative. Uniform nested loops can be modeled as multi-dimensional data flow graphs (DFGs). To achieve full parallelism of the loop body, i.e., all the computational nodes executed in parallel, substantially decreases the overall computation time. It is well known that for onedimensional DFGs retiming can not always achieve full parallelism...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004